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^ Field of the Invention 

M= This invention relates to the field of transferring quality video data objects over a 

rM 5 packet switched network in such a way that the video is played in a smooth (not jerky) 
y manner. Further, this invention relates to the video distribution systems and methods for 

dynamically segmenting video data objects into multiple files to facilitate the transfer of the 

video data objects. 
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Description of Related Art 

In the past video streaming servers required that a file be fully present before the 
sever could start streaming the file. This imposed a considerable restriction as typical 
DVD or broadcast quality videos may be several Gigabytes in size and thus imposed a 
5 large latency before a viewer could start viewing a video. 

Video is the most dominant medium in entertainment and is rapidly becoming a 
critical part of computing as well. Video is often used in applications such as virtual reality 
gaming, for example, to mimic personal or virtual environments, increasing an 
3o application's appeal and usability. Video has a large information carrying capacity and is 
J heavily used in capturing and conveying complicated situations such as news events, live 
y interviews, scientific experiments, tourist attractions, and many others. 

With the increasing availability of high bandwidth networks, video on-demand 
JHs applications are gaining popularity on global digital communications networks such as the 
S Internet as well as private and corporate digital communication internal networks 

commonly referred to as Intranets. Example applications include online training, news 
broadcasts, educational programming, corporate information, and virtual seminars directly 
to every desktop computing system or workstation. Similarly, video kiosks can be set up 
20 in enterprises and university campuses to display live video and up-to-the-minute news, 
without ever needing an on-site upgrade. 



4 



IVOO-001.2 

Video files, however, occupy huge amounts of space on computers. It requires 
about 10MB to store one minute of video in most standard compression and 
decompression video formats, including Motion Picture Experts Group standard MPEG-1, 
the Apple Computer Inc. Indeo, Intel Corp. QuickTime, and Super Mac, Inc Cinepak. That 
5 translates into 1 .2GB of space for two hours of video, the length of an average feature 
film. These tremendous storage requirements make effective on-demand sharing of video 
files at least as important as conventional file sharing. 

However, conventional file servers do not address video's unique requirements and 
%) cannot effectively support video sharing. Full-motion video, inherited from analog TV, is a 
J sequence of images played out at constant intervals. The two most common analog video 
hi formats are the National Television Standards Committee (NTSC), used in the United 
i; States and Japan, and Phase Alternation Standard (PAL), used in Europe. NTSC plays 
^ video at 30 frames per second, while PAL plays it at 25 frames per second. The 
^5 sequence of images in a video clip must be relayed at a constant interval, or else the 
y perceptual quality degrades rapidly: the motion jumps and the sound breaks. This rigid 
periodic timing property is referred to as the isochronous requirement. Referring now to 
Fig. 1 , conventional file servers 10 are designed for minimal transfer latency. Files 15 are 
thus transferred to maintain the minimum latency and are transferred as quickly as 
20 possible. The files 15 will be interleaved with other digital communication traffic on the 
network and thus non-isochronously. Without explicit mechanisms to ensure isochronism, 
delivery rates are irregular, resulting in erratic playback quality at the client computing 
system 20. 
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To avoid erratic playback, the usual approach is to download whole files 15 from 
the server 10 to the client computing system 20 before starting video playback. This 
approach results in unacceptable delays for most video files, which are large. For 
5 example, even with transfer rates as fast as 1 .5Mb/second, the initial start-up delay is 60 
seconds for a one minute video clip. 

It is thus desirable to deliver video streams isochronously, as depicted in Figure 2, 
so that video playback is guaranteed to have smooth motion and sound. The file server 
10 must now transfer or stream the files 25 such that the time between each section of the 
file is transferred at a period of time x The even interval allows the file 25 to arrive 
isochronously with the first section to be displayed before any of the remaining sections of 
the file 25 have arrived at the client system 20, This allows a video clip to begin 
practically instantaneously. 

The rapid advances in the speeds of microprocessors, storage, and network 
hardware may give a false impression that video on-demand (VOD) solutions do not need 
special purpose video streaming software. Video streaming as shown in Fig. 2 allows 
efficient playback of full motion videos over networks with guaranteed quality using 
isochronous timing. 

When an operating system's default file transfer mode is used to stream a video 
file, faster hardware may accelerate the operating system's transfer rate, but this 
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improved hardware still cannot change the fundamental, erratic behavior of a file transfer 
as shown in Fig. 1 . By default, the file transfer process does not respect the isochronous 
nature of a video stream. This typically results in a jerky and poor-quality playback of a 
video stream. The dominant factors of a system's overall streaming performance are the 
5 higher level client/server and networking processes, and are not the raw power of the low 
level physical devices. 

U. S. Patent 6,101,546 (Hunt) describes a method and system for providing data 
files that are partitioned by delivery time and data type. A file is logically partitioned into 
%) data channels where each data channels holds a sequence of data of a particular data 
J type. The data channels are logically partitioned into delivery times. The format of the file 
W explicitly sets forth the synchronization between the data channels and the delivery times 
-t- of data held within the channels. The file format is especially well adapted for use in a 
t: distributed environment in which the file is to be transferred from a server to a client. 
[4 5 Channel handlers are provided at the client to process respective data channels in the file. 
U The channel handlers are data type specific in that they are constructed to process data of 
an associated data type. The data in the file may be rendered independently of the 
delivery time of the data. 

20 U. S. Patent 6,018,359 (Kermode, et al.) illustrates a system and method for 

multicast video-on-demand delivery system. The video-on-demand system divides video 
files into sequentially organized data segments for transmission and playback. Each 
segment is repeatedly transmitted in a looping fashion over a transmission channel. The 
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rate of transmission is equal to or greater than tfie playback rate, and the lengths of the 
segments are chosen such that: 

(i) the receiver tunes into no more than a fixed number of channels 
(preferably two) at any one time; 
5 (ii) the receiver tunes into a new channel only after an entire segment 

has been received from a previous channel; and 
(iii) until a maximum segment length is attained, data is received from no 
fewer than two channels. 
The segments are sequentially presented even as new segments are being downloaded. 
9o When the display rate is equal to the transmission rate, it is found that the foregoing 
J conditions are satisfied when the relative lengths of the segments form a modified 
y Fibonacci sequence. 

U. S. Patent 5,930,473 (Teng, et al.) discloses a video application server for 
^5 mediating live video services. The video application server is to be used in a network 

including source clients and viewer clients connected to one or more shared transmission 
media. A video server is connected to one of the transmission media and is operative to 
control the broadcast and storage of multiple live or previously stored video streams. The 
control may be provided via remote procedure call (RPC) commands transmitted between 
20 the server and the clients. In one embodiment, a video presentation system is provided in 
which a video stream from a source client is continuously broadcast to a number of viewer 
clients. One or more of the viewer clients may be authorized by the source client to 
broadcast an audio and/or video stream to the other clients receiving the source video 
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stream. In another embodiment, a multicast directory is provided to each of a plurality of 
viewer clients by transmitting directory infomnation in a packet corresponding to a 
predetennined multicast address. The multicast directory indicates to a particular viewer 
client which of a number of video programs are available for broadcast to that client. 

U. S. Patent 6,101,547 (Mukherjee, et al.) describes an inexpensive, scalable and 
open-architecture media server. The multi-media server provides client systems with 
streaming data requiring soft real-time guarantee and static data requiring a large amount 
of storage space. The servers use a pull-mode protocol to communicate with client 
systems through a real-time network. Separate data and control channels enhance the 
soft real-time capability of the server. The data channel conforms to an open standard 
protocol such as such as Transmission Control Protocol (TCP), User Datagram Protocol 
(UDP), or Real-time Transport Protocol (RTP). A switched data link layer for the control 
channel permits separate intrahost control messages that may be multicast and 
broadcast. The distributed file system selects a specific data block size based upon the 
compression technique employed to enhance soft real-time guarantee. A hierarchical 
data structure combined with merging empty data blocks minimizes disk fragmentation. 
Data blocks are striped across multiple disks to improve disk utilization. A local buffer and 
a queue for both read and write requests provides support for simultaneous read and write 
data streams. 

U. S. Patent 5,805,821 (Saxena, et al.) teaches a video optimized media streamer 
user interface employing non-blocking switching to achieve isochronous data transfers. 
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The media streamer includes at least one control node; a user interface having an output 
coupled to at least one control node; at least one storage node for storing a digital 
representation of at least one video presentation; and a plurality of communication nodes 
each having an input port for receiving a digital representation of at least one video 
5 presentation therefrom. The video presentation requires a time T to present in its entirety, 
and is stored as a plurality of N data blocks. Each data block stores data corresponding 
to a T/N period of the video presentation. Each communication nodes further has a 
plurality of output ports for outputting a digital representation. A circuit switch is 
connected between the at least one storage node and the input ports of communication 
So nodes for coupling one or more input ports to at least one storage node. The user 
J interface includes a capability for specifying commands for execution, and the at least one 
yj control node is responsive to individual ones of the commands for controlling at least one 
=p: of the at least one storage node and at least one of the plurality of communication nodes, 

in cooperation with the circuit switch, so as to execute a function associated with 
rid individual ones of the commands. The commands may include video cassette recorder- 
2 like commands that include commands selected from a group that includes a Load 
command, an Eject command, a Play command, a Slow command, a Fast Forward 
command, a Pause command, a Stop command, a Rewind command, and a Mute 
command. The commands may also include commands selected from a group that 
20 includes a Play List command, a Play Length command, and a Batch command. A 
synchronous application program interface (API) is provided for coupling, via the user 
interface, a user application program to at least one control node. The API includes 
Remote Procedure Call (RPC) procedures. 
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U. S. Patent 5,550,577 (Verbiest, et al.) illustrates a video on demand network, 
including a central video server and distributed video servers with random access 
read/write memories. The video on demand network transmits video signals to user 
5 stations pursuant to the receipt of control signals issued by these user stations. In order 
to optimize the retrieval costs, this video on demand network maintains a large video 
library in a central video server and stores locally popular video signals in a plurality of 
local distributed video servers from which the latter video signals are transmitted to the 
user stations. The video signals provided by the local distributed servers are updated 
90 from the central server based upon the changing popularity of the video signals. The video 
2 on demand network of Verbiest proposes in particular to store the video signals in the 
^ local distributed servers in random access read/write memories, e.g., electronic RAMs, 
i; magnetic or optical disks from which the video signals can flexibly be supplied on-line to 
H the user stations and to store the video signals in the central server in sequential access 
memories, e.g. Digital Audio Tapes (DAT) and CD-ROMs (CDR), providing cheap mass 
y storage. 

"Performance Evaluation of QuickVideo OnDemand (QVOD) Server," InfoValue 
Computing, Inc. Technical Report IV-TR-QVOD-1 999-07-1-1, July 8, 1999, InfoValue 
20 Computing, Inc., Elmsford, NY describes a video on-demand system developed for high 
performance, effective and flexible, network-based, on-demand sharing of videos. 
QuickVideo OnDemand provides streaming throughput for broadband applications 
Further, QuickVideo OnDemand allows a linearly scalable clustering mechanism which 
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provides support for higher throughputs, if required. QuickVideo OnDemand supports all 
video formats, codecs, networks and applications, and is compatible with any open 
application platform. 

5 "Network Video Computing Via QuickVideo Suite," InfoValue Technical White 

Paper, InfoValue Computing, Inc., Elmsford, NY, 1999, describes Network Video 
Computing the core of which is video streaming. Video streaming allows the efficient 
playing of full-motion video content over networks with guaranteed quality. The rigid 
timing property of full motion video is referred to as the isochronous timing. File servers 
3;0 are designed to minimize transfer latency during conventional network transfers, and are 
J insensitive to video's unique timing requirement. As a result, delivery rates are irregular 
bj and produce erratic playback as described above. Video streaming technologies are real- 
^ time network transfers that maintain the video's critical timing property throughout the 
entire delivery period, as depicted in Fig. 2. This white paper describes the an open 
pi 5 architecture with a streaming core. 

"Web Distribution Systems: Caching and Replication" Chandbok, Ohio State 
University, 1999, found http://www.cis.ohio-state.edu/-jain/cis788- 
99/web_caching/index.html, 8/15/00, provides an overview of the current techniques for 
20 caching and replication of digital data on computer systems interconnected through a 
global or local digital communication network. Refer now to Fig. 3 for a summary of 
caching in large distributed digital processing networks. Multiple server computing 
systems 100a, 100b, 100f are high performance computing systems such as the IBM 
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Corporation RS-6000-SP, The Sun Microsystems, Inc. Enterprise 10000 Server, the 
Hewlett-Packard Netserver AA-6200, or other server systems. The computer systems 
100a, 100b, 100f are each connected to multiple storage devices 105a, 105b, 
105r. The storage devices 105a, 105b, 105r are magnetic disk devices, compact disk 
read only memory (CD-ROM) "juke boxes," or tapes drives. A group of the server 
systems 100a, 100b, 100c or lOOd, lOOe, lOOf are respectively interconnected through 
the digital communications cluster network 110 and 115 to form the server cluster 1 120 
and the server cluster 2 125. The server cluster 1 120 and the server cluster 2 125 may 
be resident with in the same enterprise data center or placed at different geographical 
locations either within the enterprises or even in different enterprises. 

The cluster networks 110 and 115 are connected respectively to the network 
routers 130 and 135. The network routers 130 and 135 are further connected to a public 
or global digital communications network 155. The global network 155 may be the public 
Internet or an enterprise's private Intranet. 

The server computer systems 100a, 100b lOOf contain database information 

systems, storage for files such as audio or video files, and other data files to accessed by 
large numbers of people either publicly or privately within an enterprise through the client 
systems 150a, 150b, 150c. 

Edge servers 140a, 140b, 140c are connected to the global network 155 and thus 
provide access portals for the client systems 150a, 150b, 150c to the global network 155 
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to communicate with each other, with other edge servers 140a, 140b, 140c, or with the 
server computer systems 100a, 100b, lOOf Each edge servers 140a, 140b, 140c is 
connected has attached data storage device 145a, 145b, 145i. The attached data 
storage device 145a, 145b, 145iis generally a magnetic disk storage device, but may 
also include a CD-ROM, magnetic tape, or other storage media. 

If a server computer systems 100a, 100b, 100f has data 160 that is requested 
by many of the client systems 150a, 150b, 150c, the network traffic to the server 
computer system 100a may to great for either the global network 155 or the cluster 
network 110 to carry and maintain a reasonable quality of service. Quality of service in 
this context means that the original data 160 is transferred repetitively relatively quickly an 
if the original data 160 is audio or video files, that the isochronous nature of the transfer of 
the data is maintained. 

If the server clusters 1 20 and 1 25 are separated geographically, it may cost less to 
maintain the quality of service by placing a copy 165 of the original data 160 in a disk 1051 
on a second server system 100d. If the copy 165 of the original data 160 is permanent, It 
is referred to as being replicated. If the copy 165 of the original data 160 is temporary, it 
is referred to as cached. As the demand for the original data 160 is increased, it may be 
desirable to either replicate or cache 170 or 175 the data even within the disks 145b or 
145i of the edge servers 150a or 150c. 
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There are many policies developed regarding which of the original data 160 is 
replicated or cached 165, 170, or 175. Further, the replacement of cached data 165, 170, 
or 175 by other data that is demanded more often is known and generally follows a least 
recently used protocol, where the cached data 165, 170, or 175 that has not been 
5 requested Is replaced by that is more requested. 

U. S. Patent 6,088,721 (Lin, et al.) teaches an efficient unified replication and 
caching protocol. The protocol provides assurance of consistent replication of objects 
from a central server to caching servers, for example, over data communication networks 
^0 such as the Internet. It is an application-layer protocol, which guarantees delivery of 
=f objects such as files. This protocol insures that objects sent by a source machine such as 
fV| a server to any number of destination machines such as caching servers actually arrive at 
i the intended caching servers even when the caching servers are temporarily unavailable, 
H- for example, due to failure or network partition. 

E U. S. Patent 6,061 ,504 (Tzelnic, et al.) illustrates a video file server using an 

integrated cached disk array and stream server computers. The video file server includes 
an integrated cached disk array storage subsystem and a multiple stream server 
computers linking the cached disk storage system to the data network for the transfer of 
20 video data streams. The video file server further includes a controller server for applying 
an admission control policy to client requests and assigning stream servers to service the 
client requests. The stream servers include a real-time scheduler for scheduling 
isochronous tasks, and supports at least one industry standard network file access 
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protocol such as Simple Network Management Protocol (SNMP) and one file access 
protocol Network File System (NFS) for continuous media file access. The cached disk 
storage subsystem is responsive to video prefetch commands, and the data specified for 
a prefetch command for a process are retained in an allocated portion of the cache 
5 memory from the time that the cached disk storage subsystem has responded to the 

prefetch command to the time that the cached disk storage subsystem responds to a fetch 
command specifying the data for the process. The time between prefetching and fetching 
is selected based on available disk and cache resources. The video file server provides 
video-on-demand service by maintaining and dynamically allocating sliding windows of 
^ video data in the random access memories of the stream server computers. 

y "Network Caching Guide," Goulde, Patricia Seybold Group for Inktomi Corp., 

4^ Boston, Ma, March 1999, describes the various types of caching approaches and the 
^ different ways for caches to be implemented. Implementations vary depending on where 
Vi5 the cache is placed, who is accessing the cache, and the quantity and type of content that 
S is being cached. Goulde describes the Inktomi Traffic Server from Inktomi Corporation. 
The Inktomi Traffic Server is capable of delivering fresh content to large numbers of users 
around the world from a large number of Web servers around the world. 

20 "Inktomi Traffic Server - Media Cache Option", Inktomi Corporation, San Mateo 

Ca., 1999, found http://www.inktomi.com, 8/15/00, describes the caching option for the 
Inktomi Traffic Server to support streaming of video data files. 
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"Implementing Multiplexing, Streaming, and Server Interaction for MPEG-4" Kalva 
et al., IEEE Transactions On Circuits And Systems For Video Technology, Vol. 9, No. 8, 
December 1999, pp. 1299-1312, describes the implementation of a streaming client- 
server system for object-based audio-visual presentations in general and l\/IPEG-4 content 
in particular. The system augments the MPEG-4 demonstration software implementation 
(IM1) for PC's by adding network-based operation with full support for the Delivery 
Multimedia Integration Framework (DMIF) specification, a streaming PC-based server with 
DMIF support, and multiplexing software. The MPEG-4 server is designed for delivering 
object-based audio-visual presentations. The system also implements an architecture for 
client-server interaction in object-based audio-visual presentations, using the mechanism 
of command routes and command descriptors. 

"New Solution for Transparent Web Caching: Traffic Server 2.1 Supports WCCP," 
Inktomi Corporation, San Mateo Ca., 2000, found 

http://www.inktomi.com/products/network/traffic/tech/wccp, 8/15/00 describes the use of 
the Web Cache Control Protocol (WCCP) from Cisco Systems, Inc. within Inktomi 
Corporation's Traffic Server. 

"API Overview," Inktomi Corporation, San Mateo Ca., 2000, found 
http://www.inktomi.com/products/network/traffic/tech/wccp, 8/15/00, describes the 
application program interface tools that are available for the Inktomi Corporation's Traffic 
Sen/er which allow customization or the Traffic Server's event processing thus allowing 
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manipulation of hypertext transaction protocol (HTTP) transactions at any point in their 
lifetime. 

"Web Cache Communication Protocol v2" Cisco Systems, Inc., San Jose, CA, 
found http://www.cisco.com/univercd/cc/td/doc/product/software/ios1 20/ 
120newft/120t/120t3/wccp.htm, 8/15/00, describes the protocol that allows the use a 
Cisco Cache Engine to handle web traffic, reducing transmission costs and downloading 
time. This traffic includes user requests to view pages and graphics on World Wide Web 
servers, whether internal or external to a network, and the replies to those requests. When 
a user requests a page from a web server (located in the Internet), the router sends the 
request to a cache engine. If the cache engine has a copy of the requested page in 
storage, the cache engine sends the user that page. Otherwise, the cache engine 
retrieves the requested page and the objects on that page from the web server, stores a 
copy of the page and its objects, and forwards the page and objects to the user. WCCP 
transparently redirects Hypertext Transfer Protocol (HTTP) requests from the intended 
server to a cache engine. 

"A Practical Methodology For Guaranteeing Quality Of Service For Video-On- 
Demand," Zamora et al., IEEE Transactions On Circuits And Systems For Video 
Technology . Vol. 10, No. 1, February 2000, describes an approach for defining end-to-end 
quality of service (QoS) in video-on-demand (VoD) services. A schedulable region for a 
video server, which guarantees end-to-end QoS, where a specific QoS required in the 
video client, translates into a QoS specification for the video server. The methodology is 
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based on a generic model for VoD services, which is extendible to any VoD system. In 
this kind of system, both the network and the video server are potential sources of QoS 
degradation. The effects that impairments in the video server and video client have on 
the video quality perceived by the end user is examined. 

As described above, video files may be very large, on the order of 1 .2GB for a two- 
hour movie or video presentation. In the digital communication networks 110, 115, and 
155 of Fig. 3, the files are generally formed into data packets for transfer. These data 
packets may not arrive to a designated client system 150a, 150b,and 150c in correct 
^0 order for processing. This requires reception of the complete file before processing may 
2 begin. If the file is an audio or video file requiring isochronous presentation of the file, the 
y files must be totally received before processing or the files must be segmented or 
i partitioned into portions to allow smaller units of the files to be processed. 

raj 5 U.S. Patent 5,926,649 (Ma, et al.) teaches a Media server for storage and retrieval 

g of voluminous multimedia data. The Media server provides storage and retrieval of 
multiple data streams in a multimedia distribution system. A given data stream is 
separated into a plurality of portions, and the portions are stored in a multi-disk storage 
system with Y disks each having X zones such that the ith portion of the given stream is 
20 stored in zone (i mod X) of disk (i mod Y). The number X of zones per disk and the 
number Y of disks are selected as relatively prime numbers. The stored data are 
retrieved using Y independent retrieval schedulers, which are circulated among the Y 
disks over a number of scheduling intervals. Each retrieval scheduler processes multiple 
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requests separated into X groups, with the requests of each group accessing the same 
disk zone during a given scheduling interval. The retrieval schedulers are also configured 
such that the retrieval requests of a given retrieval scheduler access the same disk during 
a given scheduling interval. The data stream placement technique in conjunction with the 
5 retrieval schedulers provide sequential-like parallel retrieval suitable for supporting real- 
time multimedia data distribution for large numbers of clients. 

U. S. Patent 5,936,659 (Viswanathan, et al.) illustrates a method for broadcasting 
movies within channels of a wide band network by breaking the communications path into 
^0 a number of logical channels and breaking each movie up into a number of segments of 
J increasing size. The first segment of each movie is the smallest segment is transmitted in 
M sequence over the first logical channel and repeated. The second segment of each 
4^ movie, which is proportionately larger than the first segment of each movie, is transmitted 
^ in sequence over the second logical channel and repeated. This is repeated for the total 
[4 5 number of segments which equals the total number of logical channels. The segments 
S are broadcast in such a way that, once the first segment is received at a client location, 
the subsequent segments are also received in time, so that the movie can be viewed 
continuously. 

20 U. S. Patent 5,973,679 (Abbott, et al.) describes an indexing method for allowing a 

viewer to control the mode of delivery of program material. By mapping from time to data 
position, data delivery can begin at any selected time in the program material. The 
indexing method also provides for controlling data delivery to begin at the beginning of a 
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frame of data. A synchronizing method is provided to minimize a time offset between 
audio and video data, particularly in environments using groups of pictures. 

U. S. Patent 5,996,015 (Day, et al.) November 30, 1999 describes a method of 
delivering seamless and continuous presentation of multimedia data files to a target 
device by assembling and concatenating multimedia segments in memory. The provides 
a multimedia server connected in a network configuration with client computer systems. 
The multimedia server further includes various functional units which are selectively 
operable for delivering and effecting the presentation of multimedia files to the client such 
that a plurality of multimedia files are seamlessly concatenated on the fly to enable a 
continuous and uninterrupted presentation to the client, in one example, client selected 
video files are seamlessly joined together at the server just prior to file delivery from the 
server. The methodology includes the analog to digital encoding of multimedia segments 
followed by a commonization processing to ensure that all of the multimedia segments 
have common operating characteristics. A seamless sequential playlist or dynamically 
created playlist is assembled from the selected and commonized segments and the 
resources needed to deliver and play the playlist are reserved in advance to assure 
resource availability for continuous transmission and execution of the playlist. At a 
predetermined point prior to an end point of each selected multimedia segment, the next 
selected segment is initialized and aligned in memory in preparation for a seamless switch 
to the next segment at the end of a previous segment, thereby providing a seamless flow 
of data and a continuous presentation of a plurality of selected multimedia files to a client 
system. 
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U. S. Patent 5,608,448 (Smoral, et al.) describes a hybrid architecture for a video 
on demand server. The processing requirement at each computing element in a video 
server for a video on demand (VOD) system is reduced to only those needed for VOD, 
resulting in a less expensive processor with less memory and, hence, lower cost. A 
hybrid video server architecture combines the features of massive parallel processor 
(MPP) and workstation designs. Since it is not necessary to run a parallel relational 
database program in order to accomplish VOD data distribution, a unique type of switch 
element that is well matched to the VOD server problem is employed. By matching this 
switch element technology to an appropriate data storage technique, a full featured, 
responsive VOD server is realized. 

U. S. Patent 6,061,732 (Korst, et al.) describes a data streaming system utilizing an 
asynchronous technique for retrieving data from a stream server. In an audio/video server 
blocks of data are read from a storage medium by a reader and supplied to users in the 
form of data streams. The storage medium comprises a plurality of record-carrier based 
storage units. A reader reads a batch of data units from a storage unit in a single relative 
movement of a reading head of the storage unit with respect to the record-carrier of the 
storage unit. A scheduler controls reading of blocks from the storage medium by 
detennining from which storage unit(s) data unit(s) need to be read for the block and 
placing a corresponding carrier access request in a read queue. The scheduler extracts 
for each of the storage units a batch of carrier access requests from the queue and issues 
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the batch to the reader in an asynchronous manner, in response to the reader having 
substantially completed reading data units for a previous batch for the storage unit. 

U. S. Patent 5,414,455 (Hooper, et al.) teaches a segmented video on demand 
system. In the system for distributing videos, multiple videos are stored on a mass 
storage device. Each video includes a plurality of frames of digitized video data for 
playback on a viewing device. The system includes a memory buffer for storing a 
segment of a selected one of the videos. The segment includes a predetermined number 
of frames representing a predetermined time interval of the selected video. In addition, 
the memory buffer including a write pointer and a read pointer. Software controlled 
servers are provided for witting and reading video data of the selected video to and from 
the memory buffer, independently, at locations indicated by the write and read pointers to 
transfer the selected video to the viewing device. 

Summary of the Invention 

An object of this invention is to provide a method and apparatus to dynamically 
segment video data files. 

Further, another object of this invention is to provide a method and apparatus to 
segment video data files to facilitate the transfer of the video data files or portions of video 
data files from a file server system to client computing system. 
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To accomplish these and other objects a data service system is in communication 
with multiple computing systems to provide at least one data file of many data files to at 
least one of the computing systems. The data sen/ice system has multiple data file 
storage devices in communication with each other and with any of the computing systems. 
Further the data service system has a segmentation apparatus in communication with the 
plurality of data file storage devices to dynamically fragment any for the data files or 
portion of the data files into a plurality of segments to allow transfer to and processing by 
at least one of the computing systems. The segmentation apparatus dynamically 
fragments each data file or portion of a data file as a function of demand for the data files, 
size of each data file of the plurality of data files, amount of retention space available on 
each of the plurality of data storage devices, and available bandwidth for communication 
with the plurality of computing systems. 

The segmentation apparatus interrogates a disk usage table for an identifier of the 
requested digital data file or portion of the digital data file and a range of locations within 
the multiple data storage devices where the digital data file is resident. The segmentation 
apparatus then calculates a new segment size list for the digital data file or portion of the 
digital data file describing a fragmentation of the digital data file as a function of demand 
for all digital data files resident on the digital data storage devices, size of each digital data 
file of all digital data files, amount of retention space available on each of the plurality of 
digital data storage devices, and available bandwidth for communication with the 
computing systems. 
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If the digital data file has been previously segmented, the new segment size list is 
compared to an existing segment size list. If the existing segment size list allows the 
digital data file to be more easily transferred, transferring the digital data file to the 
computing system according to the existing segment size list. However, if the new 
segment size list allows the digital data file to be more easily transferred, a new file 
identifier for each new segment ascertained by the new segment size list and a new range 
of locations for each new segment of the digital data file to identify the location for each 
new segment is created by the segmentation apparatus. The newly re-segmented digital 
data file is stored at the locations within the data storage devices allocated for each new 
segment. Each new segment of the digital data file is transferred to the requesting 
computing systems. Each request of a digital data file is evaluated for re-segmenting and 
re-segmented if the demand for the digital data file or portion of the digital data file, or the 
loading of the data storage device containing the existing segments of the data file 
indicate re-segmenting is required. 

The segmentation apparatus creates the segment size list for a data file or a 
portion of a data file by first determining a number of storage devices available to retain 
the segments of the data file. A maximum data transfer load for the storage devices is 
then determined. A minimum segment size, which is the smallest amount of data to be 
contained within one segment of the data file, is then assigned. A first segment size is 
calculated as a first function of a number of the storage devices, the current data transfer 
load, the maximum data transfer load, and the minimum segment size. The size of a last 
segment is assigned as the minimum segment size. The remaining segment sizes are 
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calculated as a second function of the number of the storage devices, the current data 
transfer load, the maximum data transfer load, and the minimum segment size. The data 
file is then partitioned into segments such that the first segment of the data file is of the 
first segment size, the last segment of the data file is of the last segment size, and all the 
5 remaining segments of the data file is of the remaining segment sizes. Once the data file 
has been segmented, the storage device that is to retain each segment of the data file are 
assigned and the address within the storage devices to identify the location of an 
assigned segment is assigned. Further, a name for each segment of the data file is 
generated by the segmentation apparatus. 



The first function to determine the first segment size is found by the formula: 



Segl 



min(SegSize^,„,V/f) 



where 



Seg1 is the first segment size, 



^5 



min is the minimum function of two variables, 



SegSizemin is the minimum segment size allowed during the 



fragmenting of the video data file, The minimum segment size 



is empirically determined and is usually about 5 second of the 



playing time of the video data file. 



20 



V is a total size of the digital data file, and 



f is determined by the formula: 




where 
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Nd is the number of storage devices available to 

retain the segnnents of the digital data file, 

Mi is the maximum digital data transfer load for 

the data storage devices, and 

C| is the current digital data transfer load of the 

data storage devices. 

The second function to determine the remaining segment sizes is found by the 
formula: 

Segn - max(SegSize^i„, V/f) 

where 

Segn is the a segment size for one segment of the remaining 
segments, 

max is the maximum function of two variables, 

SegSizemin is the minimum segment size allowed during the 

fragmenting of the video data file, 

V is a total size of the digital data file, and 

f is determined by the formula: 

where 

Nd is the number of storage devices available to 

retain the segments of the digital data file, 

Mi is the maximum digital data transfer load, and 

27 



IVOO-001.2 



C| is the current digital data transfer load, 



The segmentation apparatus further determines a file usage factor describing a 



number of requests for the data file for a period of time and a file interactivity factor 



describing a number of jumps by the computing system within the data file. The first and 



second functions are dependent upon the file usage factor and/or the file interactivity 



factor. The first function to determine the first segment size is now found by the formula : 



Segl = min(SegSizei^j„, V/f) 



where 



Seg1 is the first segment size, 

min is the minimum function of two variables, 

SegSizOmin is the minimum segment size allowed during the 

fragmenting of the video data file, 

V is a total size of the digital data file, and 

f is determined by the formula: 



Nd is the number of storage devices available to 
retain the segments of the digital data file, 
Ml is the maximum digital data transfer load, 
C| is the current digital data transfer load, 
H is the file usage factor, and 
I is the file Interactivity factor. 
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The second function to determine the remaining segment sizes is found by the 



formula 



Segn - max(SegSize;„i„, V/f ) 



where 



Segn is the a segment size for one segment of the remaining 
segments, 

max is the maximum function of two variables, 

SegSizemin is the minimum segment size allowed during the 

fragmenting of the video data file, 

V is a total size of the digital data file, and 

f is determined by the formula: 



Nd is the number of storage devices available to 
retain the segments of the digital data file, 
Ml is the maximum digital data transfer load, 
Ci is the current digital data transfer load, 
H is the file usage factor, and 
I is the file Interactivity factor. 




where 



If either the file usage factor or the interactivity factor are not considered in the 



above formula, their values are set to zero. 
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In the data service system of this invention, the data file is a video data file to be 
transferred isochronously to the connputing system. 

Brief Description of the Drawings 

Fig. 1 is a diagram of the transfer of files on a digital communications network of 
the prior art requiring minimal latency. 

Fig. 2 is a diagram of the transfer of files on digital communications network of the 
prior art illustrating isochronous file transfer. 

Fig. 3 is a diagram of a distributed computer network system illustrating replication 
of files in caches of the prior art. 

Fig. 4 is a diagram of a distributed computer network system illustrating 
segmentation of files of this invention. 

Fig. 5 is a flow diagram illustrating the method of segmentation of files of this 
invention. 
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Detailed Description of the Invention 

Refer now to Fig. 4 for a description of a video distribution system of this 
invention. The client computing systems 400a, 400b, 400c are connected through a 
communications link to an edge server 405a, 405b, and 405c. Each edge server 405a, 
405b, 405c acts as an interface for the client computing systems 400a, 400b, 400c to a 
global communications network 415. The edge servers 405a, 405b, 405c are at the 
boundary between the "front-end" and the "backend" of the video distribution system. The 
front-end being the client computing systems 400a, 400b, 400c that are the terminal 
points whereby the users can access the video distribution system. Further the edge 
servers 405a, 405b, 405c are generally internet service providers to which the client 
computing systems 400a, 400b, 400c are in communication. 

The backend of the video distribution system has server systems 420a, 420f 
that are grouped together to form server clusters 410a, 410b. The server systems 
420a, 420b, and 420c are interconnected together through the cluster network 455. The 
server systems 420d, 420e, and 420f are interconnected together through the cluster 
network 460. The router 425 provides an interface for the server cluster 1 410a to the 
global communication network 415. Likewise, the router 430 provides an interface for the 
server cluster n 410b to the global communication network 415. 

The gateway server 475 is connected through the global communication network 
415 to the edge servers 405a, 405b, 405c and thus to the client computing systems 400a, 
400b, 400c, The gateway server 475 is the central point of contact for incoming requests 
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to the system from the client computing systems 400a, 400b, and 400c. When a client 
computing systems 400a, 400b, 400c requests a video data file (on demand) or join a 
broadcast (multicast) of a video data file, it first contacts the gateway server 475. The 
gateway server 475 maintains an updated list of the server systems 420a, 420f in the 
5 system. Based on the location of the client computing systems 400a, 400b, 400c and the 
type of request, it routes the request to the appropriate server systems 420a, 420f. 

A large-scale system containing thousands of video data files must offer an efficient 
and easy to use content management service to the client computing systems 400a, 
%) 400b, 400c. Such a content management sen/ice includes capabilities to add/delete, 
J categorize, and browse video data files and is provided by the title server 450. In 
2 presence of a dedicated title server 450, the gateway server 475 redirects the client 
j:. computing systems 400a, 400b, 400c requests to the title server 450. In absence of such 
a dedicated title server 450, the gateway server 475 can be configured to provide content 
[^15 management services to client computing systems 400a, 400b, and 400c. Client 
y computing systems 400a, 400b, 400c, then, browse video data file in the gateway server. 

In a geographically distributed broadband a video distribution system of this 
invention, there will be multiple title servers 450, each for a service region. The gateway 
20 server 475 will route the client computing systems 400a, 400b, 400c requests to 

appropriate title servers 450 based on the location of the client computing systems 400a, 
400b, 400c. 
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A distribution server 470 is used to introduce new contents in the a video 
distribution system of this invention. Once a new video data file is available, a media 
distributor uses this service to propagate the title to different service regions of a 
geographically distributed system. The distribution server 470 consists of four distinct 
5 components. A Distribution Center, which is a remote service, is used by media 
distributors to push new video data files to regional server systems 420a, 420f A 
Distributor Console, a web based remote graphical user interface (GUI), is used to specify 
locations and contents to be pushed to remote server systems 420a, 420f. A set of 
Asset Managers, which are local to regional server systems 420a, 420f, is responsible 
^ for managing and tracking contents in the regional server systems 420a, 420f. A set 
2 of asset databases, one database per regional server system 420a, 420f, which stores 
2 the meta data for the available contents (video data files) in that regional server systems 
£ 420a, 420f Asset managers use this database to keep track of local video data files. 
Multiple asset managers can share one asset database. The title server 450 also uses 
this database to generate a categorized, browsable list of video data files. 

A media distributor uses the distributor console to schedule distribution of new 
media data objects (video data files) to the a video distribution system of this invention. 
The new video data files generally reside in a tertiary storage 445 such as a robotic DVD. 
20 The media distributor specifies when to push the title, the list of target regional sites, and 
the textual meta data related to the video. Among other things, the meta data of a title will 
possibly contain information required to categorize it as well as a set of searchable 
strings, which can be used to search the content of the video data files. The distributor 
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console connects with the remote distribution center 470 and delivers the schedule. The 
distributor console contacts the asset managers in the specified target server systems 
420a, 420f, and schedules the delivery of the new content Once an server systems 
420a, 420t receives the new video data file, it first stores the content in any available 
5 space in a local disk 480a, 480r. Then, it updates the asset database with the 

information on the new video data file (including the received meta data on the video data 
file). If it does not have any available space, it replaces an old video data file using a 
programmed policy. 

%) Based on the client computing systems 400a, 400b, 400c request (browsing by 

i5 category, or searching using a string), the title server 450 queries the asset database, and 
y creates a list of video data files for the client computing systems 400a, 400b, 400c to 

browse. The title server 450 uses aggressive caching techniques to improve the 
H performance of the query. When new information is added in the asset database, the 
^5 cache in the title server 450 is invalidated. 

It is sometimes possible for a title server 450 to have information on a video data 
file, which is not wholly available in the local storage 480a, 480r, for various reasons. 
Portions of the video data file may have been replaced because the asset manager 
20 needed space for a new video data file, or only a portion of a video data file was 

propagated from the distribution center. Once a client computing systems 400a, 400b, 
400c requests such a video data file, server system 420a, 420f, is fetches the video 
data file to the local storage 480a, 480r: The server system 420a, 420f allocates 
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free space in the local storage 480a, 480r possibly by replacing a portion of a resident 
video data file. The server system 420a, . . . , 420f contacts the distribution server 470 
providing the name of the video data file and the remaining portion of the video data file. 
Once the distribution server 470 is ready, the server system 420a, 420f fetches the 
5 remaining portion of the video data file, stores it in the allocated free space, and updates 
the asset database. 

Once user of a client computing systems 400a, 400b, 400c selects a video data file 
to be viewed, it contacts the admission server 435, which based on the bandwidth 
9o requirements and the file location of the video data file, assigns a video server system 
^ 420a, 420f from the server clusters 410a, 410b. 

The admission server 435 provides a set of mechanisms, which are used to 
H implement different policies for load balancing. The admission server 435 maintains a 
^5 cluster topology, a disk usage table, a node usage table, and a cluster map. The cluster 
topology maintains the connection information of the cluster. It itemizes a list of server 
systems 420a, 420f of a server cluster 410a, 410b, which can access any of the disks 
480a, 480r . The cluster topology contains the server system 420a, 420f 
identification that is the mount point where a disk 480a, 480r is mounted, and the 
20 access status of the disk 480a, 480 r. 

The disk usage table maintains the capacity (maximum data rate in Mbps) and the 
current load (data rate in Mbps) for each disk 480a, 480r in the server cluster 410a, 
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410b. The node usage table maintains the streaming capacity (maximum data rate in 
Mbps) and the current load for each node in the server cluster 410a, 410b. The cluster 
map contains maintains an up to date list of network address (internet protocol address), 
port and the status of the important server system 420a, 420f in the distribution 
system, and it maintains a list of server systems 420a, 420f in the cluster 410a, 410b, 
their network addresses and their status. A server systems 420a, 420f can be in one 
of two states: Live (L) and Failed (D). Additionally, the admission server 435 maintains a 
supporting data structure, required to provide fault tolerance and authenticated access to 
the server cluster 410a, 410b. The data structure maintains a table containing the list of 
active sessions per server system 420a, 420f, and a similar table for active sessions 
per disk 480a, 480r. 

The configuration server 485 allows an administrator to define and to configure 
server clusters 410a, 410b and the distributed server installations. It maintains an up to 
date information of the distributed installation using a periodic monitoring mechanism and 
asynchronous update events from the servers 420a, 420f in the system. 

As described, the video data files may be several gigabytes in size. In order to 
facilitate the transfer of the video data files to client computing systems 400a, 400b, 400c 
for viewing by a user, it is desirable to fragment the video data file into smaller segments. 
Each segment is assigned a file name and a location within any of the disks 480a, 
480r. and 495a, 495x. When a client computing system 400a, 400b, 400c requests a 
video data file, the admission server 435 retrieves the listing of the segments of the 
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requested data file from the disk usage table. It should be noted, that the requested video 
data file may in fact be any portion of a larger video data file not just the whole video data 
file. It should further be noted that the portion of the video data file requested may not 
encompass whole segments by may also contain fractional segments. 

5 

Refer now to Fig. 5 to discuss the dynamic segmentation of the this invention that 
allows the segmentation of the a video data file or a portion of a video data file to be 
segmented or re-segmented according to the activity of the disks 480a, 480r, and 
495a, 495x and the request patterns for the video data file by the client computing 
9o system 400a, 400b, 400c. The request 500 for a video data file (or portion of a video data 
J file) has an identification (file name) of the requested video data file and a range or 
yj indication of the beginning location and size of the requested video data file. As described 
* above, the admission server 435 retrieves the disk usage table and calculates a new 
!r; segmentation list. The new segmentation list and the current segmentation list from the 
[;il 5 disk usage table are compared 520 to determine if the requested video data file should be 
S re-segmented. 

Further, the loading of the disks 480a, .... 480r, and 495a, 495x is compared 
520 to a threshold value. If the requested video data file does not require segmenting, the 
20 admission server 435 transmits 560 a service request to the appropriate server system 

420a 420f. The server systems 420a, 420f containing the segments of the 

requested video data file begin to stream the segments of the video data file to the client 
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computing system 400a, 400b, 400c. The admission server 435 then updates the hit 
counter 560 within the disk usage table indicating the usage of a video data file. 

If all segments of the requested video data file are not present or the newly 
5 calculated segment size listing indicates a re-segmenting of the requested video data file 
is required or the disk loading also indicates that the requested video data file needs to be 
re-segmented according to the new segment listing, the admission server 435 generates 
530 new video data file names or identification for each new segment. The admission 
server 435 then generates 540 the disk locations for each new segment of the requested 
y 0 video data file. The segments are then copied from the old segments present on the 

server systems 420a, 420f or retrieved from the backing storage unit 445 of the 
hi distribution server 470. 

H= The segment size of a requested video data file of a requested portion of a video 

5 data file is a function of demand for all video data files resident on the disks 480a, 
^; 480r, and 495a, 495x on which the requested video data file resides, size of the 
requested video data file, amount of storage space remaining on each disk 480a, 
480r, and 495a, 495x, and available bandwidth for communication from the disks 
480a, 480r, and 495a, 495x to the client computing systems 4Q0a, 400b, 400c 
20 requesting the video data file. 

The placement of each segment is determined by the disk loading of the disks 
480a, 480r, and 495a, 495x, the relative geographical location of those client 
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computing systems 400a, 400b, 400c requesting the video data file. Tha initial request for 
the video data file may have generated a primary segmented video data file 490a. The 
primary segmented video data file 490a is distributed in this case over ■::he disks 480a, 
480b, and 480c. 

If the loading of the disks 480a, 480b, and 480c increased above the threshold or 
the demand from the client computing systems 400a increases, a copy of segment x is 
placed at the edge server 405a. Further, if the loading of the disks 495a, 495b, and 495c 
indicates such, the segment x can be dynamically re-segmented to segments x.a 490b, 
x.b 490c, and x.c 490d. In this case, segments x.a 490b, x.b 490c, and x.c 490d are 
disbursed over the disks 495a, 495b, and 495c. 

The requests for the segment x from the client computing system 400c indicate 
that the segement x should be re-segmented into four segments x.d 4S0e, x.e 490f, x.f 
490g, and x.g 490h. In this case, the disk loading for the disk 495g is sufficient to allow 
all four sub-segments segments x.d 490e, x.e 490f, x.f 490g, and x.g 490h to reside on 
the single disk 495g. 

It is apparent that there can be not only multiple copies of a video data file within 
the video distribution system of this invention, but multiple copies of the segments of the 
video data file that are further divided into sub-segments as the reques'is video data files 
or portions of video data files indicate that new segment sizes are reauired. 
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Refer now to Fig. 6 for a discussion of the calculation 510 of Fig. 5 of a new 
segment listing of a video data file or a portion of a video data file. The segmentation 
process begins once a client computing system 400a, 400b, 400c has contacted the 
admission server to request a video data file. 

The method begins by initiating 600 a segment counter to indicate a first segment 
of the video data file. A file remaining indicator denotes the amount of the video data file 
remaining to be segmented. The file remaining indicator is set 605 to the total file size of 
the requested video data file. 

The segment counter is then tested 610 to indicate whether the first segment is 
being created. During the creation of the first segment, the' first segment size is 
determined 615 by the formula: 

Seg1 = min(SegSize„,„V/f) 

where 

Segl is the first segment size, 

min is the minimum function of two variables, 

SegSizGmin is the minimum segment size allowed during the 

fragmenting of the video data file, The minimum segment size 

is empirically determined and is usually about 5 second of the 

playing time of the video data file. 

V is a total size of the digital data file, and 

f is determined by the formula: 
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where 

Nd is the number of storage devices available to 
retain the segments of the digital data file, 
Ml is the maximum digital data transfer load, 
C| is the current digital data transfer load, 
H is the file usage factor (to be discussed 
hereinafter), and 

I is the file Interactivity factor(to be discussed 
hereinafter). 

The file remaining indicator is decremented 620 by the size of the first segment 
size and the segment counter is incremented 625 to the next segment to have its size 
determined. 



The file remaining indicator less the minimum segment size is compared 630 to the 
minimum segment size. If the file remaining indicator less the minimum segment size is 
not less than the minimum segment size, the next segment is now determined by the 
formula: 

Segn- max(SegSize^j„, V/f) 

where 

Segn is the a segment size for one segment of the remaining 
segments, 
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max is the maximum function of two variables, 

SegSizemin is the minimum segment size allowed during the 

fragmenting of the video data file, 

V is a total size of the digital data file, and 

f is determined by the formula: 



Nd is the number of storage devices available to 
retain the segments of the digital data file, 
Mi is the maximum digital data transfer load, 
C) is the current digital data transfer load, 
H is the file usage factor (to be discussed 
hereinafter), and 

I is the file Interactivity factorfto be discussed 
hereinafter). 




where 



The file remaining indicator is now decremented 620 by the new segment size and 



the segment counter is incremented 625 to the next segment for which the size is to be 



determined. 



This determining of the subsequent segment size continues until the size of the file 



remaining less the minimum segment size is less than the minimum segment size. At this 
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point, the next to the last segment (n-1) is set 640 to the size of the file remaining less the 
minimum segment size. The last segment is set 645 to the minimum segment size. 

The admission server 435 then assigns file names to each segment described In 
5 the segment listing and allocates locations within the disks 480a, 480r of the server 
systems 420a, 420f. The segmented video data file 490a is transferred from the 
distribution server 470 to the server systems 420a to be written to the disks 480a, 480b, 
480c. 

|p The video distribution system as shown in Fig. 4 illustrates a system having local 

cluster networks 455, and 460, and the global communication network 415. It is apparent 
y that the sen/er clusters 410a and 410b do not require the diuster networks 455 and 460 to 

virtually construct the server clusters 410a and 410b. Further, the disks 480a, 480r 
M= may be grouped in such fashion that they can be associated with one or more of the 
ftj5 server systems 420a, 420f. The generalized structure allows the configuration server 
t;; 485 to allocate the functions of the system to any of the server systems 420a, 4201 
For instance the admission server 435 and the gateway server 475 may in fact be the 
same computing system and additionally, may be one of the sen/er systems 420a, 
420f . Also, any of the edge servers 405a, 405b, or 405c may physically be on of the 
20 server systems 420a, 420f. 

The segments of the video data files 490a are shown as distributed over multiple 
disks 480a, 480b, and 480c, associated with the server system 420a. Depending on the 
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file usage factors, and the interactivity factors, various segments or copies of segments 
490a, 490h may be placed at otlier server systems 420a, 420f, on the admission 
server 435, the configuration server 485, or even an edge server 405a, 405b, or 405c. 
The distribution of the segments 490a, 490h allows the balancing of the loading (the 
5 amount of data being transferred) of the disks 480a, 480r and disks 495a, 495w. 
The admission server 435 controls the placement of the segments and sub-segments and 
will eliminate segments of video data file based on a policy that will erase those segments 
that are least recently used, starting at the end of a video data file. Thus certain video 
data files may have a low number of segments present on the disks 480a, 480r of the 
30 server systems 420a, 420f. A request for a video data file having segments missing 
^ requires that the distribution server 470 recreate the segments of the video data file 
fz. requested and transfer them to the server systems 420a, ,1., 420f. However, those video 
% data file segments at the beginning of the video data file can be transferred to the client 

system 400a, 400b, 400c for viewing, while the distribution server 470 is recreating those 
ftS missing segments. 

The file usage factor as cited above is a measure of demand for a video disk file. It 
is indicative of such factors as the number of requests for a given video data file, the 
timing of the requests over a period of time, and the geographical distribution of the 
20 requests. The interactivity factor, as cited above, is a measure of how often a user 

causes an interruption of the processing of the video data files to replay a portion of the 
video data file, accelerate playing of the video data file (fast fonA/ard), skip a portion of the 
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video data file, or pause a particular frame of the video data file. These functions are 
similar to the functions performed by a video cassette recorder (VCR). 

As described above, if the video data file is transferred as a whole file, the whole 
video data file must be present at the client system 400a, 400b, 400c in order for a user to 
begin viewing of the video data file. This latency degrades the quality of service 
experienced by the user. Thus, the segmenting is advantageous. However, if the video 
data files is relatively small segmenting is not needed. Therefore, the size of the video 
data file is a factor cited above in determining the segment size. 

The load or the amount of data being transferred to or from an individual disks 
480a, 480r and 495a, 495w is allocated between a Vead action (transferring the 
video data file isochronously to a client system 400a, 400b, 400c for viewing by a user), a 
write action (transferring the video data file to a disk 480a, 480r and 495a, 495w), 
or a copy action (a disk to disk transfer of the video data file). The total bandwidth or 
transfer rate for a single disk is thus divided in the read action, the write action, or the 
copy action. The load of the is the amount of the total bandwidth consumed for the 
transfer of the requested video data files resident on the disk. Therefore, the segment 
size is determined by the number of disks 480a, 480r and 495a, 495w available to 
contain the video data file (some maybe off line or too full to accept the video data file) 
and the loading of the available disks. 
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It is well known in the art that while the above describes a system to distribute 
video data files to client systems, the apparatus is implemented as a program code for 
execution on a computing system. The program code maybe obtained from media such 
as storage nodes of the cluster network or the global communication network, or stored on 
5 storage media such a read only memory (ROM), or a magnetic disk. The program code 
executed by the computing system executes the method for segmenting video data files to 
facilitate the transfer of the video data files. The program executed is as described in Fig. 
6. 

0 While this invention has been particularly shown and described with reference to 

the preferred embodiments thereof, it will be understood by those skilled in the art that 
various changes in form and details may be made without departing frcm the spirit and 
scope of the invention. 

15 The invention claimed is: 
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